Detecting Hate Speech in Social Media
نویسندگان
چکیده
In this paper we examine methods to detect hate speech in social media, while distinguishing this from general profanity. We aim to establish lexical baselines for this task by applying supervised classification methods using a recently released dataset annotated for this purpose. As features, our system uses character n-grams, word n-grams and word skip-grams. We obtain results of 78% accuracy in identifying posts across three classes. Results demonstrate that the main challenge lies in discriminating profanity and hate speech from each other. A number of directions for future work are discussed.
منابع مشابه
Analyzing the Targets of Hate in Online Social Media
Social media systems allow Internet users a congenial platform to freely express their thoughts and opinions. Although this property represents incredible and unique communication opportunities, it also brings along important challenges. Online hate speech is an archetypal example of such challenges. Despite its magnitude and scale, there is a significant gap in understanding the nature of hate...
متن کاملResponding to Hate Speech on Social Media: A Class Leads a Student Movement
In the Spring of 2012, fans of the Gonzaga University basketball team used hate speech on social media site Twitter to express their frustration at losing a game to the Brigham Young University team. In response, the students in the Hate Studies in Business course started a student-led movement to “Take the Hate Out of Hoops.” The students applied their lessons in virtue ethics and leveraged th...
متن کاملA Survey on Hate Speech Detection using Natural Language Processing
This paper presents a survey on hate speech detection. Given the steadily growing body of social media content, the amount of online hate speech is also increasing. Due to the massive scale of the web, methods that automatically detect hate speech are required. Our survey describes key areas that have been explored to automatically recognize these types of utterances using natural language proc...
متن کاملHateful Symbols or Hateful People? Predictive Features for Hate Speech Detection on Twitter
Hate speech in the form of racist and sexist remarks are a common occurrence on social media. For that reason, many social media services address the problem of identifying hate speech, but the definition of hate speech varies markedly and is largely a manual effort (BBC, 2015; Lomas, 2015). We provide a list of criteria founded in critical race theory, and use them to annotate a publicly avail...
متن کاملSurfacing contextual hate speech words within social media
Social media platforms have recently seen an increase in the occurrence of hate speech discourse which has led to calls for improved detection methods. Most of these rely on annotated data, keywords, and a classification technique. While this approach provides good coverage, it can fall short when dealing with new terms produced by online extremist communities which act as original sources of w...
متن کامل